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The insect midgut epithelium is generally lined with a unique 
chitin and protein structure, the peritrophic membrane (PM), 
which facilitates food digestion and protects the gut epithe- 
lium. We used gel electrophoresis and mass spectrometry to 
identify the extracted proteins from the silkworm PM to obtain 
an in-depth understanding of the biological function of the 
silkworm PM components. A total of 305 proteins, with mo- 
lecular weights ranging from 8.02 kDa to 788.52 kDa and the 
isoelectric points ranging from 3.39 to 12.91, were success- 
fully identified. We also found several major classes of PM 
proteins, i.e. PM chitin-binding protein, invertebrate intestinal 
mucin, and chitin deacetylase. The protein profile provides a 
basis for further study of the physiological events in the PM of 
Bombyxmori. [BMB Reports 2012; 45(11): 665-670] 



INTRODUCTION 

The peritrophic membrane (PM) is an invertebrate-unique 
semi-permeable structure that lines the midgut of an insect. The 
presence of this matrix in insects has been recognized for over 
two centuries. Lyonet (1) found a sheath encasing the food bolus 
in lepidopteran larvae in 1972. Balbiani (2) was the first to refer 
to this matrix as PM in 1 980. The PM is mainly composed of chi- 
tin nanofibers embedded in the matrix of proteins, glycoproteins, 
and proteoglycans. Wang and Granados (3) provided the first 
molecular model of the lepidopteran PM. However, the func- 
tions and the chemical components of the matrix are still 
rudimentary. Sudha and Muthu (4) reported on the high abrasion 
in the midgut epithelium and the membranous bodies of an un- 
usual Bombyx mori mutant that lacks a PM in the larval stage 
caused by food in 1988. Over the past decade, the development 
of molecular, biological, and biochemical approaches improved 
remarkably the awareness of the PM functions. The main bio- 
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logical functions of PM include the spatial organization of diges- 
tion, protection from ingested toxins, and serves as a physical 
barrier to pathogens. 

PM proteins have at least two chitin-binding domains that are 
presumed to function in the cross-linking of the chitin fibrils. The 
peritrophins-44 and peritrophins-48 are the first integral PM gly- 
coproteins reported from the larvae of Lucilia cuprina. These 
proteins contain a signal peptide, followed by five sequential 
CBDs, conforming to the CX1M0CX5-6CX9-19CX10-14CX4-14C peri- 
trophin-A consensus. This multiple cysteine-rich domain enables 
the protein to bind with chitin, the major constituent of PM (5, 
6). Wang and Granados (7) identified an invertebrate intestinal 
mucin (MM) from Trichoplusia ni that exhibits strong association 
with chitin and could degrade by baculovirus enhancement, but 
is highly resistant to proteolytic attack from endogenous midgut 
proteases. Ferreira et al. (8) found two peritrophins from 
Tenebrio molitor and Spodoptera frugiperda by cDNA library 
screenings with antibodies in 2007. The two peritrophins are im- 
mobilized on the midgut cell surface and concur in PM 
formation. A downregulated chitin deacetylase-like protein se- 
lected from a group of expression changes the Helicoverpa armi- 
gera genes after infection with H. armigera single nucleopolyhe- 
drovirus, which might reduce the susceptibility of this bollworm 
to baculovirus by decreasing its PM permeability (9). 

Various proteomic approaches has been utilized recently to 
identify the proteins from the PM of various insects. Ramos et al. 
(10) used a two-dimensional gel electrophoresis to analyze the 
black fly type I PM and found two PM1 -specific proteins (66 and 
61 kDa). Lehane et al. (11) showed that the PM contains a range 
of proteins, most of which require relatively harsh treatment for 
their solubilization. Moskalyk et al. (12) found approximately 20 
major proteins from the Aedes aegypti PM and approximately 40 
major proteins from the Anopheles gambiae PM through 2-D gel 
electrophoresis and lectin-binding affinity assays. Shotgun pro- 
teomics is a remarkably powerful technology for identifying 
complex samples of proteins on a global level. Two separate 
studies have recently focused on shotgun proteome analysis of 
PM. Forty-one proteins were identified from the PM in H. armi- 
gera using 1D-LC-MS/MS (13). Dinglasan et al. (14) described 
the complete PM proteome of the A. gambiae and identified 209 
proteins by MudPIT analysis in 2009. 

The PM not only plays important roles in facilitating food di- 
gestion and providing protection to the gut epithelium, but can 
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also be a significant structural target for insect control. The pro- 
teins of the PM may also provide unique opportunities for the 
control of insect pests and vector borne diseases. The silkworm, 
Bombyx mori, is not only a domesticated insect for silk pro- 
duction, but is also a model lepidopteran insect for pest control. 
Therefore, we used the shotgun proteome technology to identify 
comprehensively the proteins from the PM of silkworm in the 
present study. 

RESULTS AND DISCUSSION 

Overview of proteomic analysis 

We characterized the silkworm larval PM proteins profile using 
the shotgun proteome technology. Several previously reported 
proteins detected in other insect studies were observed in the 
present study. However, the limited number of lepidopteran se- 
quences available for PMF (Peptide Mass Fingerprinting) in da- 
tabase searches resulted in either in low coverage and proba- 
bility scores or no significant matches. In the present work, the 
combined databases containing proteins sequences downloaded 
from NCBI and from silkDB allowed us to overcome some of 
these limitations and identify more protein homologues for the 
selected proteins. The confidence in the data was controlled by 
searching for the data in target-decoy database to estimate the 
false-positive rate and further validating the data using TPP. 

The protein extracts from the PM of fifth instar actively feeding 
larvae were separated by SDS-PAGE, and the gel was cut into 7 
slices for the LOMS/MS analysis (Fig. 1A). The obtained 7 raw 
data sets were searched against a combined in-house database 
according to the procedure in the Materials and Methods sec- 
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Fig. 1. Protein Profiles of the silkworm PM in the 5th day of the 
fifth instar. (A): 1 D SDS-PAGE pattern of PM proteins extracted 
from the 8. mori. Each gel lane was cut into 7 bands (PM1 to 
PM7) followed by in-gel digestion used for shotgun analysis. The 
letters 'M' and 'S' represent protein marker and sample, 
respectively. (B): Distribution of isoelectric points (pi) and molec- 
ular weight (MW) for all of the proteins identified by the shotgun 
approach. 



tion, and a total of 305 proteins were identified (Table SI). The 
number of proteins was higher than the previously reported 
number in other insect studies. Approximately 12.45% (38/305) 
of these proteins had a single unique peptide, whereas the others 
had more than one and even up to 145 unique peptides, such as 
the chlorophyllide A binding protein precursor. Approximately 
43.28% (132/305) of the identified proteins had amino acid cov- 
erage of more than 20%. 

The distribution of molecular mass and isoelectric points (pi) 
of the identified proteins is shown in Fig. 1B. The molecular 
weights mostly ranged between 8.02 kDa and 788.52 kDa, ex- 
cept for BGIBMGA006856-PA (2002.29 kDa) and BGIBMGA- 
010471-PA (1538.87 kDa). A total of 79.34% (242/305) of the 
identified proteins were smaller than 100 kDa. The most acidic 
and basic proteins identified by LC-MS/MS had pis of 3.39 and 
12.91, respectively. 

Chitin and chitin-binding proteins 

The presence of multiple chitin-binding domains in PM proteins 
has been suggested to be a mechanism for PM formation. We 
found that several PM proteins have the chitin-binding domains 
of several invertebrate proteins. The BGIBMGA001504-PA is a 
chitin-binding protein, and the chitin-binding domains indicate a 
mechanism to bind chitin in the PM, homologous with CBP1 
and CBP2 of T. ni, which might possess a proteinase-resistant 
mechanism against proteolytic degradation by the digestive pro- 
teinases (15). The mammalian mucins were likely to have a num- 
ber of functions, i.e., lubrication of the passage of food through 
the gut, protection from invasion by bacteria, and protection of 
epithelial cells from digestive proteases (16). Invertebrate in- 
testinal mucin (MM) had many of the molecular properties of 
mammalian mucins. Wang and Granados (7) identified an MM 
from T. ni that also exhibits a strong association with chitin and 
could degrade by a baculovirus enhancement, but is highly re- 
sistant to proteolytic attack from endogenous midgut proteases. 
Chitin deacetylase (CDA) plays an important role in the softening 
of the insect cuticle to allow easier mycelial penetration as well 
as evading lysozyme action in fungi and bacteria. The roles of in- 
sect chitin deacetylases are not well understood. Drosophila 
melanogaster mutants suggest that deacetylase activity increases 
the rigidity of chitin (1 7). HaCDA5a lacks a typical chitin-binding 
domain, but can bind with chitin, and has possible mechanism 
to reduce the susceptibility to baculovirus by decreasing the PM 
permeability (9). 

Toxin binding and immunity 

The aminopeptidase N (APN) serves a variety of functions main- 
ly dependent on its location. This protein works in cooperation 
with endopeptidases and carboxypeptidases in the lepidopteran 
larval midgut to digest proteins in food. APN is also a major Bt 
Cry toxin receptor in the midgut of insects. A 120 kDa APN of 
silkworm binds CrylAa with a 7.6 nM affinity (18). APN also act 
as a mediator of Cry toxin susceptibility to distinguish between 
Cry-binding proteins and proteins that confer Cry toxin 
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susceptibility. The first cadherin-like protein shown to interact 
with Cry toxins, BT-Ri, is a 210 kDa glycoprotein identified in 
Mancluca sexta (1 9). Nagamatsu et al. (20) found a 1 75 kDa gly- 
coprotein, BtR1 75, that was identified as a CrylAa receptor in 6. 
mori. The gene of BtR175 was added to CrylAa-resistant Sf9 
cells in vitro, which made the Sf9 cells susceptible to the CrylAa 
toxin. Alkaline phosphatase (ALP) is also a Cry toxin receptor in 
various insect species. In Heliothis virescens, a membrane- 
bound form of ALP not only could bind to CrylAc, but the ex- 
pression levels are also reduced in the resistant strain, suggesting 
a functional role in toxicity (21). ALP mediates CrylAc suscepti- 
bility and is important in binding toxin in M. sexta (22). A. ae- 
gypti ALP also exhibit properties similar to those of the lep- 
idopteran ALPs (23). 

Several other candidate receptor proteins were identified. 
Serine protease inhibitors not only play important roles in the bi- 
ological immune systems, but can also potentiate the insecticidal 
activity of Cry toxins. They reduce the degradation of Cry or of 
toxin-receptors to enhance the insecticidal activity (24). The 
chlorophyllide A-binding protein (ChBP), identified from the 
midgut of B. mori, can bind to a derivative of chlorophyll (25). 
The ChBP was characterized by binding to CrylAa, Cry1 Ab, and 
CrylAc under nondenaturing conditions, and probably has anti- 
microbial activity, similar to P252 (26). 

Several identified proteins were related with immune 
response. The serine proteases (SP) are a large group of proteo- 
lytic enzymes that participate in the innate immune response in 
insects. Zhao et al. found 65 SP gene expression changes after 
the silkworm was infected by four different microorganisms (27). 
The lipase was isolated from the silkworm digestive juice and 
has been proved to have strong resistance against the BmNPV, 
and the expression was upregulated by the virus attack (28). 



Gram-negative bacteria-binding protein (GNBP) is a highly con- 
served innate immunity molecule, which recognizes bacteria. 
There are three GNBP genes contained in the genome of 
Drosophila, and two of these genes play a role in pattern recog- 
nition in the upstream of Toll signal pathway. GNBP can bind 
with (3—1 and 3-glucan to activate cellar immune response in 8. 
mori (29). 

Functional annotation and classification 

Gene Ontology tools were used to analyze the PM proteome to 
present an overall view on the functional categories of PM 
proteins. The results indicate that 216 of the 305 identified pro- 
teins show at least one matched GO annotation. The proteins 
were classified into cellular component, molecular function, and 
biological process according to the GO hierarchy using WEGO 
(Fig. 2). Majority of the proteins were assigned to the cell and 
cell part in terms of "cellular component." The highest dis- 
tribution was associated with binding and catalytic activity in 
terms of "molecular function." The remaining proteins were 
linked to different activities, such as antioxidation, electron trans- 
port, and enzyme regulation, and among others. The proteins 
were classified according to different categories based on 
"biological process." The highest number of proteins was map- 
ped to proteins involved in metabolic process, followed by pro- 
teins associated with cellular process. 

The PM proteins were mapped to the KEGG ortholog level for 
the KEGG analysis. Two hundred and thirty-four different path- 
ways were linked to the PM. These pathways were classified into 
metabolism (45), genetic information processing (4), environ- 
mental information processing (18), cellular processes (29), or- 
ganismal systems (80), and human diseases (58) (Fig. 3). Among 
the organismal systems, the digestive system was the most 
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Fig. 2. GO categories of the identified 
proteins in the silkworm PM. The 
identified proteins were classified into 
cellular component, molecular function, 
and biological process by WEGO, ac- 
cording to the GO terms. 
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Fig. 3. Categories of commonly re- 
lated pathways in PM according to 
KEGG pathway taxonomy. The path- 
ways were methodically clustered into 
metabolism, genetic information proc- 
essing, environmental information pro- 
cessing, cellular processes, and human 
diseases. 



active. Moreover, five pathways were related to the immune 
system. This phenomenon is likely because the PM not only 
could help the midgut in coordinating the movement of enzymes 
and nutrient uptake, but also plays an important role in insect 
defense strategy. 

The studies on the expression profiles of proteins in the PM 
can facilitate the identification of molecular targets that can be 
used for developing novel and environmentally benign control- 
ling strategies. In the present work, we identified successfully 
305 proteins from silkworm PM in the 5th day of the fifth instar 
by proteomic tools. We have characterized several PM proteins 
based on the proteomic analysis, and a holistic model that re- 
lates the PM structure to its function is presented. The PM pro- 
teins provide a basis for further study of the physiological events 
in the PM of Bombyx mori, and help us to understand the PM 
structure and the role in immune response. 

MATERIALS AND METHODS 

Sample preparation and gel electrophoresis 

The B. mori strain DaZao was used and maintained at State Key 
Laboratory of Silkworm Genomics, Southwest University. The 
silkworm was reared using a mulberry under a 12 h/12 h 
light/dark photoperiod at 26° + 1°C with 70% to 85% relative 
humidity. The silkworms were pricked and dissected gently to 
collect the intact PMs from the 5th day of the fifth instar larval. 
The PMs were washed with Milli-Q water until no food debris 
remained. Approximately 100 PMs were accumulated and stor- 
ed at -80°C. 

The total protein of PMs was extracted in 400 ul lysis buffer 
(containing 2.5% SDS, 10% glycerin, 5% B-mercaptoethanol and 
50 mM Tris-HCI pH 8.8), was homogenized with a tissue grind- 



er, and then vortexed. The PMs were fully disintegrated after in- 
cubation with the lysis buffer, blended several times for 1 h at 
4°C, and then centrifuged for 30 min at 12,000 rpm at 4°C. The 
total protein content in the supernate was determined using the 
method of Bradford (30). The sample was boiled for 10 min, and 
each 100 ug proteins was loaded into three lanes of one-dimen- 
sional (1 D) SDS-PAGE, which was performed by standard meth- 
od using 5% stacking gel and a 15% resolving gel. The electro- 
phoresis was run on the Hoefer SE600 Ruby (Hoefer), and the 
gels were stained by Coomassie Brilliant Blue R250 (Sigma). 

LC-MS/MS analysis 

After visualization by Coomassie staining, each gel lane was man- 
ually cut into 7 bands from top to bottom. The gel bands were 
sliced into smaller pieces and subjected to in-gel tryptic digestion 
as described by Shevchenko et al. (31). Tryptic peptides from gel 
bands were analyzed by LC-MS/MS using a linear ion trap mass 
spectrometer (Finnigan LTQ, Thermo Finnigan). The peptide mix- 
ture was loaded onto a Zorbax 300SB-C18 peptide traps column 
(Agilent Technologies) by the autosampler and were desalted for 
20 min, prior to separation by reverse-phase analytical column 
(0.15 x 150 mm [RP-C18], Column Technology Inc.). The mo- 
bile phases consisted of buffer A (0.1% methanoic acid in water) 
and buffer B (84% ACN, 0.1% methanoic acid in water). The 
peptides were eluted using buffer B linear gradient from 4% to 
50% in 50 min, from 50% to 100% buffer B in 54 min, and buf- 
fer B maintained at 100% in 60 min. The data-dependent acquis- 
ition was performed on the LTQ-Orbitrap mass spectrometer in 
the positive ion mode. The spray voltage was 3.2 kV, and the ca- 
pillary temperature was 200°C. Each full MS scan was followed 
by the MS/MS scans of the 20 most intense peaks in the MS spec- 
trum with dynamic exclusion enabled. The m/z scan range was 
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100 to 2,000 for full mass range. 
Database search 

The databases were constructed using 'silkworm' as a keyword in 
searching from the NCBI (http://www.ncbi.nlm.nih.gov/) protein 
database. In addition, the open reading frames of silkworm were 
obtained from silkworm genome sequences, which were down- 
loaded from silkDB (http://silkworm.swu.edu.cn/silkdb/doc/ 
download.html). The combined databases containing 23,01 7 pro- 
tein sequences (8,394 sequences from NCBI and 14,623 se- 
quences from silkDB) were analyzed using MS/MS data with the 
algorithm of SEQUEST, which is a module of Biowork 3.0 
(Thermo Finnigan) on a local server. The SEQUEST parameters 
were outlined as follows. The mass tolerances of precursor and 
fragmentation ions were set to 1 Da, full tryptic constraint allow- 
ing 1 missed cleavage, fixed modification (carboxamidomethyl) 
on cysteine, and variable modification (oxidation) on methionine. 
The search results were filtered with the following criteria: charge 
+ 1, Xcorr > 1.9; charge +2, Xcorr > 2.2; charge +3, Xcorr > 
3.75; DelCN > 0.1. The datasets were searched against the tar- 
get-decoy database under the same parameters stated above to es- 
timate the false-positive rate. The target-decoy database is a com- 
bination of forward and reverse protein sequences (32). The con- 
fidence was controlled by filtering the initial identifications to 
FDR < 1 % for each sample class. 

The database search results were further validated using 
TPP (v4.4 VUVUZELA rev 1) (http://localhost/tpp-bin/tpp_gui. 
pi), which was downloaded from the website (http://tools. 
proteomecenter.org/TPP.php) and installed with the default 
options. The peptide and protein probability thresholds for 
running PeptideProphet and Protein Prophet were set at 0.8 
and 0.9, respectively. 

Bioinformatics analysis 

Gene Ontology (GO) IDs of the proteins identified in the present 
study were obtained from the Gene Ontology (http://www.ge- 
neontology.org/). GO classifications of these proteins were con- 
ducted using WEGO (http://wego.genomics.org.cn/) following 
the method described by Ye et al. (33). The EC numbers of the 
identified proteins were acquired (if available) with E-value < 
e-15 by using KEGG GENES BLASTP (http://blast.genome.jp/), 
and then were subjected to search against the KEGG reference 
pathway database (http://www.genome.jp/kegg/tool/search_path- 
way.html). The pathways with at least three EC numbers were 
accepted and classified according to the definition of KEGG 
(http://www.genome.ad.jp/kegg/pathway.html). 
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